The Emergency Transcriber
نویسندگان
چکیده
The thesis presents a novel situation awareness tool for sensing classification. We proposed a general scheme for sensing, and applied that to build an acoustic tool for teams of first responders and emergency personnel. It constitutes an audio interface for reliably recording and disseminating situation progress as extracted from the team’s audio communications. The tool that we built is intended for emergency teams operating in noisy acoustic environments, where standalone speech recognition systems fail to deliver desired accuracy. Such teams typically follow predefined collaborative workflow as dictated by the relevant engagement protocols, specifying their roles and communications. Given the critical nature of the situation, the vocabulary used is often constrained and dependent on the current stage of the workflow being executed. Treating a traditional speech recognition component as a noisy sensor, the novelty of our tool lies in exploiting knowledge of the workflow to correct the noisy measurements. The intellectual contribution in this exploitation lies in the joint estimation of the current state of the workflow together with the correction of sensed data, given only the noisy (speech) measurements and an overall workflow description. Evaluation shows that the tool provides a significant accuracy enhancement compared to the standalone speech recognition, effectively coping with the noisy environment of emergency teams.
منابع مشابه
On Exploiting Structured Human Interactions to Enhance Sensing Accuracy in Cyber-physical Systems
In this article, we describe a general methodology for enhancing sensing accuracy in cyber-physical systems that involve structured human interactions in noisy physical environment. We define structured human interactions as domain-specific workflow. A novel workflow-aware sensing model is proposed to jointly correct unreliable sensor data and keep track of states in a workflow. We also propose...
متن کاملAn Auditory Model Based Transcriber of Vocal Queries
In this paper a new auditory model-based transcriber of vocal melodic queries is presented. Our experiments show that the new system can transcribe queries with an accuracy between 76 % (whistling) and 85 % (singing with syllables), and that it outperforms four state-of-the-art systems it was compared with.
متن کاملAn analysis of transcription consistency in spontaneous speech from the buckeye corpus
We present a preliminary analysis of transcriber consistency in labeling and segmentation of words and phones in the Buckeye corpus of spontaneous, informal speech. We find that pairwise inter-transcriber agreement on exact phone label match was 76%, and segmentation agreement within 20% of phone pair length was 75%, though longer phones are more consistently segmented than shorter phones. Patt...
متن کاملTranscribing against time
We investigate the problem of manually correcting errors from an automatic speech transcript in a cost-sensitive fashion. This is done by specifying a fixed time budget, and then automatically choosing location and size of segments for correction such that the number of corrected errors is maximized. The core components, as suggested by previous research [1], are a utility model that estimates ...
متن کاملTranscriber: Development and use of a tool for assisting speech corpora production
We present ``Transcriber'', a tool for assisting in the creation of speech corpora, and describe some aspects of its development and use. Transcriber was designed for the manual segmentation and transcription of long duration broadcast news recordings, including annotation of speech turns, topics and acoustic conditions. It is highly portable, relying on the scripting language Tcl/Tk with exten...
متن کامل